A Minimal Rare Substructures-Based Model for Graph Database Indexing
نویسندگان
چکیده
Systems such as proteins, chemical compounds, and the Internet are stored as graph structures in graph databases. A basic, common problem in graph related applications is to find graph data that contains a query. It is not possible to scan the whole data in graph databases since subgraph isomorphism testing is an NP-complete problem. In recent years, some effective graphs indexes have been proposed to first obtain a candidate answer set and then performing verification on each candidate by checking subgraph isomorphism. However, candidate verification is still inevitable and expensive when the size of the candidate answer set is large. In this paper, we propose a new Structural Graph Indexing, called GIRAS, based on RAre subGraphs (RGs) as the basic indexing feature. The idea is to have a single characteristic that can uniquely identify a graph in a database. Few substructures are ideal candidates since they are rare graphs, which means they occurs in only a small number of graphs in the database. Thus, in confronting a query using these indexes, the size of the candidate answer set is close to that of the exact answer set, and the number of subgraph isomorphism tests is small. Therefore, the time of the candidate verification step is reduced to a minimum.
منابع مشابه
Kernel-based Similarity Search in Massive Graph Databases with Wavelet Trees
Similarity search in databases of labeled graphs is a fundamental task in managing graph data such as XML, chemical compounds and social networks. Typically, a graph is decomposed to a set of substructures (e.g., paths, trees and subgraphs) and a similarity measure is defined via the number of common substructures. Using the representation, graphs can be stored in a document database by regardi...
متن کاملمدل دو مرحله ای شکاف- گلچین برای نمایه سازی خودکار متون فارسی
Purpose: Each language has its own problems. This leads to consider appropriate models for automatic indexing of every language. These models should concern the exhaustificity and specificity of indexing. This paper aims at introduction and evaluation of a model which is suited for Persian automatic indexing. This model suggests to break the text into the particles of candidate terms and to c...
متن کاملمیزان انطباق الزامات ساختاری مجلات علوم پزشکی کشور ایران با معیارهای نمایهسازی اسکوپوس
Background and Aim: In the recent years the number of science research health journals has increased in Iran. These journals should be based on the standards and criteria required in international indexing database. The aim of this study was to determine the adaptation rate of structural requirements on the Iranian medical journals with the criteria of indexing based on Scopus indexing database...
متن کاملGraph-based Visual Saliency Model using Background Color
Visual saliency is a cognitive psychology concept that makes some stimuli of a scene stand out relative to their neighbors and attract our attention. Computing visual saliency is a topic of recent interest. Here, we propose a graph-based method for saliency detection, which contains three stages: pre-processing, initial saliency detection and final saliency detection. The initial saliency map i...
متن کاملA Hypergraph-Based Model for Graph Clustering: Application to Image Indexing
In this paper, we introduce a prototype-based clustering algorithm dealing with graphs. We propose a hypergraph-based model for graph data sets by allowing clusters overlapping. More precisely, in this representation one graph can be assigned to more than one cluster. Using the concept of the graph median and a given threshold, the proposed algorithm detects automatically the number of classes ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016